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(54) Computer based system and method of determining a satisfaction index of a text 



(57) The invention relates to a computer based 
method and system for automatically determining in a 
document an object satisfaction score (44), choosing at 
least one of either a positive score or a negative score, 
comprising the steps of: subdividing the document into 
windows of words; selecting one of said windows when 
it contains at least one descriptive property correspond- 



ing to a key subject of said object and/or when it does 
not contain any descriptive property corresponding to 
an anti-key subject of said object; processing each se- 
lected window to determine a positive or negative score 
of said window by counting the occurrences of positive 
and negative concepts; and cumulating the scores of the 
selected windows to obtain said object satisfaction 
score for the document. 
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Description 

Field of the Invention 

[0001] The present invention relates to computer 5 
based systems and methods of selecting and analyzing 
candidate documents containing specific contents or 
subject matter. More precisely, the invention relates to 
an automatic system for and method of processing doc- 
uments to evaluate the global opinion or satisfaction to 
about an object such as a product, a service, a brand, 
an event or similar. For example, the documents which 
are found in the World Wide Web. 

Background 15 

[0002] A technical problem exists in evaluating appar- 
ent trends in the perception of people regarding prod- 
ucts, services, brands, events, etc., in a global commu- 
nity, where many, if not most, of such opinions are found 20 
in documents and files available on the Word Wide Web. 
[0003] There is a need for an automated, computa- 
tionally efficient, method and system foranalyzing this 
plethora of documents and files, and to efficiently gen- 
erate information which would indicate perceptions 25 
(positive or negative) of people regarding such products 
and services, by geographical or by another similar seg- 
mented manner. 

[0004] For clarity, in the following description, "ob- 
jects" means products, services, brands, events, etc. 30 
that can be identified by words in a document and on 
which one wants to have a global satisfaction evalua- 
tion. 

[0005] On the web, numerous documents are pub- 
lished each day, and numerous opinions are expressed 35 
in different ways, for example, by setting up web pages, 
by exchanging information in newsgroups and discuss- 
ing subjects in chat-rooms. These documents contain 
valuable information about the perception of objects. 
[0006] Capturing or monitoring the perception of ob- 40 
jects on the web or any other network allows evaluating 
the positive and/or negative (and/or opinionless) per- 
ception of the users/customers. More generally, percep- 
tion analysis allows a corporation, institution or the like 
to know the opinion of actual or potential users/custom- 45 
ers on their products, services, etc. This satisfaction 
opinion can then be used for modifying/improving the 
object. 

[0007] Known evaluation methods providing satisfac- 
tion indexes use document search engines to perform 50 
keyword searches for publications on the web. Such 
search engines are able to find on the web all docu- 
ments containing one or more keywords and to down- 
load documents related to a specific topic. Then, the ex- 
amination and evaluation of the satisfaction index (pos- 55 
itive, negative, opinionless) of each document are either 
done manually, i.e. by readers extracting the global im- 
pression of the document, or automatically with artificial 



neural networks which are very complex, especially to 
configure for a new domain. The complexity of the neu- 
ral networks analysis is prejudicial to the speed of 
processing a voluminous data base taking into account 
the configuration time. 

[0008] An approach which comes in mind to one who 
will attempt to automatically evaluate a satisfaction in- 
dex is to use a classification algorithm. Such a classifi- 
cation algorithm consists in a statistical text learning al- 
gorithm which can be trained to approximately classify 
documents, given a sufficient set of labeled training ex- 
amples. Classification algorithms are already used to 
automatically catalog news articles, sort e-mail or learn 
the reading interest of users, and one might think that 
such algorithms could be trained to distinguish between 
positive and negative documents. However, not only 
they require a large, often prohibitive, number of labeled 
documents (i.e. hand classified) but they do not adapt 
well to understanding the context of specific words (for 
example positive references to a given product). 
[0009] A purpose of the present invention is to over- 
come at least one disadvantage of the known solutions 
for automatically determining a satisfaction index about 
an object. 

[001 0] Another purpose of the present invention is to 
provide a computer based apparatus and method of se- 
lecting and analyzing candidate documents in order to 
determine a satisfaction index about a predetermined 
object, which leads to a very simple and fast software 
algorithm. 

[001 1] A further purpose of the present invention is to 
make a discrimination between appreciation and de- 
scription of the object. 

Brief Summary of the Invention 

[0012] To attain the above purposes and others, the 
present invention provides a computer based method 
for automatically determining in a document an object 
satisfaction score, choosing at least one of either a pos- 
itive score or a negative score, of an object such as a 
product, service, brand, event or similar, comprising the 
steps of: 

subdividing the document into windows of words; 
selecting one of said windows when it contains at 
least one descriptive property corresponding to a 
key subject of said object and/or when it does not 
contain any descriptive property corresponding to 
an anti-key subject of said object; 
processing each selected window to determine a 
positive or negative score of said window by count- 
ing the occurrences of positive and negative con- 
cepts; and 

cumulating the scores of the selected windows to 
obtain said object satisfaction score for the docu- 
ment. 
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[001 3] According to an embodiment of the present in- 
vention, the method further comprises the steps of: 

storing in a computer memory a list of positive con- 
cepts concerning said object; 
storing in said computer memory a list of negative 
concepts concerning said object; 
storing in said computer memory a list of descriptive 
properties corresponding to key subjects of said ob- 
ject; and 

storing in said computer memory a list of descriptive 
properties corresponding to anti-key subjects of 
said object. 

[0014] According to an embodiment of the present in- 
vention, the method further comprises the steps of: 

counting the number of occurrences of negation 
particles in the current selected window; and 
inverting the positive and negative scores of the cur- 
rent window if said number is odd. 

[001 5] According to an embodiment of the present in- 
vention, said windows of words are sentences or seg- 
ments of sentence. 

[001 6] According to an embodiment of the present in- 
. vention, the positive or negative score of a previous sen- 
tence is subtracted from the document corresponding 
score when a word considered as a sentence separator 
and able to constitute a negator of the previous sen- 
. tence appears in the current sentence. 
[001 7] According to an embodiment of the present in- 
vention, the method also comprises the step of storing 
in said computer memory a list of previous negators. 
[0018] The present invention also provides a compu- 
ter based method for automatically determining an ob- 
ject satisfaction index of an object such as a product, 
service, brand, event or similar, among documents con- 
tained in at least one local or remote memory storage 
comprising the steps of: 

selecting from the memory storage documents re- 
lating to said object; 

processing the selected documents to determine 
their object satisfaction scores; and 
cumulating the scores of the selected documents 
and deducing from obtained cumulated scores said 
object satisfaction index. 

[001 9] According to an embodiment of the present in- 
vention, the selection of a document is based on the oc- 
currences of said key subjects and/or anti-key subjects 
counted during said processing step of each window. 
[0020] According to an embodiment of the present in- 
vention, a document score is added to the correspond- 
ing cumulated score only if the absolute value of the dif- 
ference between the positive and negative scores of this 
document is higher than a predetermined satisfaction 



threshold, or is otherwise considered as opinionless. 
[0021] According to an embodiment of the present in- 
vention, the method further comprises counting the 
number of occurrences of competitor subjects in each 

5 document and wherein a document score is added to 
the corresponding cumulated score only if said number 
of competitor subject occurrences is lower than a pre- 
determined comparison threshold. 
[0022] According to an embodiment of the present in- 

10 vention, the competitor subjects are part of the anti-key 
subjects and are counted during said processing step 
of each selected window. 

[0023] The present invention also provides a compu- 
ter based system for automatically determining an ob- 
15 ject satisfaction index for an object such as a product, 
service, brand, event or similar comprising: 

a general purpose computer having at least an out- 
put device, at least an input device, at least a com- 
20 munication device for communicating with at least 
one local and/or remote document memory storage, 
and a central processing unit, 
said central processing unit comprising: 

25 - at least a computer memory for storing at least 
a first list of positive concepts concerning said 
object, a second list of negative concepts con- 
cerning said object, and a third list of descrip- 
tive properties of said object, 

30 a counter for counting, in windows of words of 

a selected document, the occurrences of posi- 
tive and/or negative concepts, for cumulating 
the scores of said windows to obtain a satisfac- 
tion score of said object in the document, a win- 

35 dow being taken into account in said satisfac- 

tion score only if it contains at least one descrip- 
tive property corresponding to a key subject of 
said object and/or if it does not contain any de- 
scriptive property corresponding to an anti-key 

40 subject of said object. 

[0024] According to an embodiment of the present in- 
vention, said counter cumulates the satisfaction scores 
of selected documents of the memory storage to deter- 
45 mine a satisfaction index of said object. 

[0025] According to an embodiment of the present in- 
vention, the system further comprises a search engine 
for selecting documents on the basis of said descriptive 
properties. 

so [0026] According to an embodiment of the present in- 
vention, said counter also counts the occurrences of 
said descriptive properties. 

[0027] According to an embodiment of the present in- 
vention, the system further comprises a data table to 
55 store configuration choices. 

[0028] According to an embodiment of the present in- 
vention, the system further comprises a list of previous 
negators so that, when such a previous negator appears 
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in a current window, the positive or negative score of the 
previous window is disregarded. 
[0029] According to an embodiment of the present in- 
vention, the system further comprises tools to review the 
objects description properties and/or the positive and 
negative concepts and/or the document satisfaction 
scores, and/or the base document or meta data. 
[0030] The present invention also provides an auto- 
matic system for determining a satisfaction index of an 
object such as a product, service, brand, event or simi- 
lar, among a group of documents contained in at least 
one local or remote memory storage. 
[0031] The present invention also provides a compu- 
ter program product. 

Description of the Drawings 

[0032] These purposes, features and advantages of 
preferred, non-limiting, embodiments of the present in- 
vention will be described by way of examples with ref- 
erence to the accompanying drawings, in which: 

FIG. 1 schematically represents the main elements 
of one exemplary embodiment of the system ac- 
cording the present invention; 
FIG. 2 schematically represents the main data base 
tables of the system according to one exemplary 
embodiment of the present invention; 
FIG. 3 is a flowchart of the main steps of one exem- 
plary embodiment of the method according to the 
present invention; and 

FIG. 4 is a more detailed flowchart of the steps of 
processing a document according to the present in- 
vention. 

[0033] For clarity, only those elements and steps use- 
ful to the understanding of the invention have been 
shown in the drawings. Especially, details of the pro- 
gramming steps according to the system of the invention 
will not be detailed as it will readily occur to those skilled 
in the art. 

Detailed Description 

[0034] In the present specification: 

"Subjects" means words or groups of words consti- 
tuting descriptive properties of the object to be an- 
alyzed, i.e. able to select (and differentiate) the con- 
cerned object from the others; for example, applied 
to a product, the subjects may comprise the trade 
name of the product (eventually associated with the 
name of the manufacturer for a generic name) and 
its abbreviations if any, and the trade name of the 
manufacturer or seller and its trademarks or abbre- 
viations if any; according to the present invention, 
the subjects are classified into "key subjects" and 
"anti-key subjects", that is subjects identifying pos- 



itively the object for which one should evaluate a 
satisfaction index and subjects which should be ex- 
cluded for further evaluation; and 
"Concepts" means words or groups of words con- 
5 stituting an appreciation of an object and which 
could be classified into positive or negative opin- 
ions. 

[0035] With reference to FIG. t , one exemplary em- 

10 bodiment of a satisfaction index determination system 
according to the principles of the present invention in- 
cludes a general purpose computer (i.e. a general pur- 
pose personal computer, or networked server or mini- 
computer) having a central processing unit 10 (CPU) 

15 and including counter(s) and storage memories. The 
system also comprises standard input peripherals 1 2-IN- 
PUT PERIPHERALS (such as a keyboard, a mouse, a 
scanner, a floppy disk and/or CD reader, etc.) and stand- 
ard output peripherals 14-OUTPUT PERIPHERALS 

20 (such as a screen, a printer, etc.). The system further 
includes a data base 3-DATA BASE (or any memory 
storage element) for receiving and storing, inter alia, text 
documents to be processed. These documents are, in 
the exemplary embodiment of FIG. 1 , downloaded from 

25 the web 2 (INTERNET). The computer also comprises 
a communication device for communicating with the 
memory storage(s) (for example, the data base) which 
could be local or remote. 

[0036] The texts or documents to be processed ac- 
30. cording to the present invention may have been select- 
ed and stored (for example, downloaded from the web) 
before the satisfaction index determination processing. 
Alternatively, the determination processing may be 
. processed on-line, the web being then considered as a 
35 remote memory storage. Any intermediate embodiment 
may be considered. 

[0037] According to the present invention, the docu- 
ments to be processed are selected on the basis of de- 
scriptive properties of the object (product, service, 

40 brand, event, etc.) for which one wants to evaluate an 
satisfaction index. The descriptive properties are words 
or group of words identifying the object, so that when 
such a word or group of words appears in a text, the text 
has to be considered for its document satisfaction index 

45 according to the invention. In the preferred embodiment 
of the present invention the descriptive properties are 
classified into key subjects (key words or groups of 
words) and anti-key subjects (anti-key words or group 
of words). 

50 [0038] The invention may use any classical keyword 
search provided that, for the preferred embodiment, it 
also includes an anti-keyword functionality to preferably 
exclude some sentences to be evaluated in the proc- 
essed documents. Conventionally, the keyword search 

55 tool performs the search on the basis of words identify- 
ing the object (i.e. the name(s) of the object and its syn- 
onyms). 

[0039] According to the present invention, the docu- 
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ments are subdivided into windows of words. Preferably, 
a window corresponds to a sentence or a segment of 
sentence. The documents are then processed sentence 
after sentence for evaluating the number of occurrences 
of positive or negative concepts determining the score 5 
of a sentence, that is a positive or negative opinion. Fur- 
ther, the score of a sentence is taken into account in the 
score of the document corresponding to the number of 
occurrences of positive or negative sentences only if the 
sentence refers to a key subject and/or does not refer 10 
to an anti-key subject. 

[0040] The data base 3 also contains the tables (lists) 
used by the method for determining a satisfaction index 
according to the invention. 

[0041] FIG. 2 shows, very schematically, an example 1$ 
of architecture of tables stored in the data base and/or 
in the computer memory according to a preferred em- 
bodiment of the present invention. The tables are sym- 
bolized by blocks. The data base contains: 

20 

- a table 31 (TEXT DB) containing the texts to be 
processed; 

a table 32 of key subjects and/or anti-key subjects 
(KEY-ANTIKEY SUBJECTS) containing lists of de- 
scriptive properties of the object for which one 25 
wants to obtain a satisfaction index according to the 
invention (table 32 contains for example, names 
identifying the objects and their synonyms and 
eventually the competitors); and 

- a concept table 33 (CONCEPTS) listing at least 30 
positive (>0) and negative (<0) concepts relating to 
the object to be analyzed. 

[0042] In the preferred embodiment of FIG. 2, the data 
base 3 also contains, in particular, one or more of the 35 
following tables: 

a table 34 (WORDS) containing lists of specific 
words, expressions or signs to optimize the analysis 
of each document (especially, a list of previous *o 
negators (PREVIOUS NEGATORS) and a list of 
sentence separators (SEPARATORS) and punctu- 
ation signs, etc.); a previous (sentence) negator is 
a word which inverts the satisfaction opinion of a 
previous sentence or segment of sentence. For ex- 45 
ample, the word "but" in the sentences "Hamburger 
are great. But hot-dogs are better" or in the sen- 
tence "Hamburgers are great but worse than hot- 
dogs" inverts the evaluation of the hamburgers in 
the first sentence or in the first segment; so 

- a table 35 of generic lists (GENERIC LISTS) con- 
taining words or expressions relating to the activity 
domain (for example, the industries (INDUS- 
TRIES), the manufacturers (MANUFACTURERS), 

the markets (MARKETS), etc.); these lists may also 55 
adapt the sense of a word according to the activity 
domain; 

a table 36 of user lists (USER LISTS) containing 



lists dedicated to the users of the system (for exam- 
ple, lists of competitors (COMPETITORS), products 
(PRODUCTS), etc.) and which, most often, special- 
izes the generic lists; 

a method table 37 (METHOD) containing the differ- 
ent options for processing the documents (for ex- 
ample, selection of sentences by keywords or ex- 
clusion by anti-keywords); and 
an index table 38 (DOC. SCORES) storing the doc- 
ument satisfaction scores of each processed docu- 
ment (object satisfaction score or index for a docu- 
ment corresponding to the number of occurrences 
of positive or negative concepts) for various purpos- 
es (for example, to perform a manual detailed anal- 
ysis of some documents to determine the reasons 
of a negative judgment on an object). 

[0043] In FIG. 2, the main functional relations be- 
tween the different tables are indicated. This corre- 
sponds only to an example and the relations or links be- 
tween the different tables of a data base according to 
the invention depend on the choices (programming) 
made for implementing the invention. For example, the 
list of competitors may be considered as a partial or an 
additional list of anti-key subjects. Those skilled in these 
arts will recognize that additional tables or different ta- 
bles may be used to accomplish the purposes of the in- 
vention. 

[0044] The system of the present invention also com- 
prises tools to review the objects description properties 
and/or the positive and negative concepts and/or the 
document satisfaction scores. Such tools allow collect- 
ing information about the processed documents and 
meta data associated to them, and adapting the system 
to a particular field of use. 

[0045] FIG. 3 is a flowchart of the main steps of the 
method according to an exemplary embodiment of the 
present invention. 

[0046] At block 41 (START), the system is initialized 
and configured according to the processing method se- 
lected. According to the present invention, at least one 
of the following configuration choices can be made : 

Selecting the kind of sentences to be taken into ac- 
count for determining the score of the document. 
One can either take into account only the sentences 
containing one occurrence or more of at least one 
of the key subjects, or only the sentences not con- 
taining any anti-key subject, or only the sentences 
containing at least one occurrence or more of one 
of the key subjects without containing any anti-key 
subject. For example, if the object is a car of a man- 
ufacturer X, the words "car" and "X" may be key- 
subjects, and the words "plane" and "Y" (a compet- 
itor) may be anti-key subjects. Supposing the text: 
"The cars manufactured by X are good. The planes 
manufactured by X are good. The cars manufac- 
tured by X and customized by Y are bad. The plane 
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manufactured by Y are good. The bicycles manu- 
factured by Z are good." According to the method 
selecting only the sentences containing key sub- 
jects, only the first three sentences will be selected. 
According to the method selecting only the sentenc- 
es not containing an anti-key subject, only the first 
two and last sentences will be selected. According 
to the method combining key and anti-key subjects, 
only the first sentence will be selected. 
Determining a comparison threshold, that is a 
threshold of number of occurrences of competitors 
from which a document is considered as referring 
to another object. It is used, for example, to elimi- 
nate a document relating to the product of a com- 
petitor and in which the object to be evaluated is 
cited as being compared to one of a competitor. 
With the foregoing example, the document will be 
taken into account if the threshold is higher than 2. 
Determining a satisfaction threshold in order to con- 
sider as opinionless a document in which the differ- 
ence (in absolute value) between the numbers of 
occurrences of positive and negative concepts is 
too low. For example, the satisfaction threshold can 
be 2. Thus, if a document contains 8 positive sen- 
tences and 7 negative sentences, it will be consid- 
ered as opinionless. 

Selecting whether the negation particles are to be 
processed only with respect to the current sentence 
or also as an eventual previous negator. 

[0047] At block 42 (SELECT DOC), the documents 
(texts) contained in the data base (table 31, FIG. 2) to 
be processed are sequentially selected on the basis of 
the occurrence of a key subject. 
[0048] The selected documents are sequentially 
processed (block 43, DOC. PROCESSING) to deter- 
mine the document score. Step 43 results in a document 
score (DOC. SCORE) which is stored at block 44. 
[0049] The document scores are summed in block 45 
(ADD TO OBJECT SATISF. INDEX) to provide an object 
score which results in the object satisfaction index. Ac- 
cording to a preferred embodiment, the corresponding 
document is stored with its satisfaction index. 
[0050] Then, if all the documents have not yet been 
processed (block 47, END OF BASE?), the next docu- 
ment is selected (block 46, NEXT) and processed. 
When all the documents have been processed, results 
are outputted (block 48, END), for example, in the form 
of graphics displayed or printed. Various kinds of graph- 
ic presentations could be provided depending on the in- 
formation wanted by the user. The presentations of the 
processed information are within the ability of those 
skilled in the art. In particular, intermediate presenta- 
tions can be displayed during processing. 
[0051] FIG. 4 shows a more detailed flowchart of 
steps 43 to 45 of the preferred embodiment of the 
present invention. For clarity, the corresponding algo- 
rithm will be described considering that subjects and 



concepts are only words. However, in practice, subjects 
and concepts may be groups of words and the adjust- 
ments of the algorithm are in the ability of those skilled 
in the art. 

5 [0052] Each selected document (block 42, FIG. 3) is 
processed, sentence by sentence. Preferably, the doc- 
uments are processed at the same time on the basis of 
both key subjects (and/or anti-key subjects) and posi- 
tive/negative (or neutral) concepts. 

10 [0053] While the words belong to the same sentence 
or segment of sentence (no punctuation, conjunction, or 
special character as defined in table 34), the following 
detections and counts (block 51, WORD COUNTER) 
are sequentially performed for each word of the sen- 

15 tence: 

counting the number of occurrences of positive con- 
cepts (good, best, useful, pleasant, etc.) in the sen- 
tence, that is sequentially incrementing a positive 
20 score of the sentence when the current word is a 
positive concept; 

counting the number of occurrences of negative 
concepts (bad, worst, useless, unpleasant, etc.) in 
the sentence, that is sequentially incrementing a 
25 negative score of the sentence when the current 
word is a negative concept; 
counting the number of occurrences of key words 
in the sentence; 

counting the number of occurrences of anti-key 

30 words in the sentence; 

counting the number of occurrences of competitor 
words in the sentence (the number of competitor 
words may correspond to the number of anti-key 
words but has to be cumulated along the whole doc- 

35 ument); and 

counting the number of negators (negation parti- 
cles) in the sentence. 

[0054] If the first word of a new sentence or segment 
40 of sentence (block 52, PN?) is a previous negator ("bur, 
"however*, etc.), then the previous sentence has to be 
ignored. In practice it is sufficient to store the score of a 
sentence up to the processing of the following one and 
to, if necessary, decrement the score of the document 
45 of the previous sentence score (block 53, DECREMENT 
PREVIOUS). 

[0055] At the end of the sentence or segment tested 
at block 54 (END OF SEGM?) the system decides 
whether or not there is a next word (block 55, NEXT 
50 WORD) to process in step 51 . If not, then one processes 
the following tests and counts : 

if the number of negation is an odd value (block 56, 
NEG=ODD?), then one exchanges the values of the 
55 positive and negative scores of the sentence (block 
57, EXCH.>0, <0); 

if the number of key words is equal to 0 or if the 
number of anti-key words is not equal to 0 (block 
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58, KEY ANTIKEY?), then the negative and positive 
scores of the sentence are both forced to 0 (block 

59, SEGSCORES=0); if not, at block 60 (SATIS- 
FACTION COUNTER) , if the positive score of the 
sentence is higher than its negative score, then one 
increments by 1 the positive score of the document, 
else if the negative score of the sentence is higher 
than its positive score, one increments by 1 the neg- 
ative score of the document, else the sentence is 
considered as opinionless; 

- adding (block 61, COMPETITOR COUNTER.) the 
number of occurrences of competitors in the sen- 
tence to the total number of competitors; 

- storing (block 62, STORE SCORES) the opinion 
(positive, negative or opinionless) of the current 
sentence as being the opinion if the previous sen- 
tence (for the previous sentence negator function) 
before initializing the sentence scores for the next 
sentence. 

[0056] At the end of the document tested at block 63 
(END OF DOC?) the system decides whether or not 
there is a next sentence or segment (block 64, NEXT 
SEG/SENT) to process from step 51 . If not, then one 
performs the following tests and counts: 

if the total number of competitors is higher than the 
comparison threshold (block 65, COMP?), then the 
document is not considered in the satisfaction index 
and the eventual next document is processed 
(block 47. FIG. 3); else, 

if the difference between the positive and negative 
scores of the document is lower than the satisfac- 
tion threshold (block 66, SATISF?), then the docu- 
ment is considered as opinionless, else the positive 
or negative index of the object is incremented by 1 
(block 67, INDEX COUNTER) depending on the 
score of the document. 

[0057] Various alterations or modifications can be 
provided alone or in combination. Among others: 

the number of references to competitors may be ob- 
tained either by considering the occurrences of anti- 
key subjects as competitor occurrences or by 
counting separately the occurrences of competi- 
tors; 

one can force the scores of the document to 0 if the 
difference between its positive and negative scores 
is lower than a satisfaction threshold; to obtain the 
opinionless score of the object, one could either add 
the number of documents having both negative and 
positive scores at 0, or calculate the difference be- 
tween the number of processed documents and the 
sum of positive and negative scores; and 
the comparison threshold can concern the differ- 
ence (in absolute value) between the occurrences 
of key subjects and anti-key subjects in the docu- 



ment. 

[0058] Other adjustments or modifications may be 
provided. For example, one could accept that the cur- 
5 rent word to be processed might not exactly correspond 
to one found in the lists, but contain it. One might also 
first convert each word to the lower case to simplify the 
word search. Further, the search engine(s) used in the 
invention for determining if a word is present or not in a 
list of words is(are) conventional. 
[0059] An advantage of the present invention is that 
processing the documents with lists of positive con- 
cepts, negative concepts and descriptive properties is 
faster than a manual evaluation and simplifies the con- 
figuration of the system with respect to an automatic 
evaluation based on artificial neural networks. 
[0060] Another advantage of the present invention is 
that it gives better results of the satisfaction index than 
the known automatic methods. Especially, processing 
each document sentence after sentence allows taking 
into account the context of a positive or negative con- 
cept. 

[0061] Another advantage is that the possibility to ex- 
clude sentences (and documents) containing anti-key 
subjects and/or to only take into account sentences con- 
taining key-subject(s) improves the viability of the eval- 
uation without notably impairing the simplicity of the sys- 
tem and the speed of the processing. 
[0062] Another advantage is that processing the doc- 
ument for the key subjects and/or anti-key subjects and 
for the positive and/or negative concepts during the 
same run allows determining both if a document is to be 
taken into account and the satisfaction index of said doc- 
ument within a single sequentially processing pass of 
said document. Further, it simplifies the program. In par- 
ticular, it allows combining the results only by multiplying 
or adding the selection results (key, anti-key) to the sat- 
isfaction results (positive, negative). 
[0063] Another advantage is that allowing ignoring the 
sentences not comprising a key subject renders the sys- 
tem of the present invention able to evaluate correctly 
the satisfaction index of a text where some descriptive 
concepts could be confused with appreciation concepts. 
Indeed, a classification in negative or positive words 
may not always be sufficient to obtain a correct evalua- 
tion of a document satisfaction index. For example, ap- 
plied to a movie, without ignoring the sentences only de- 
scribing the movie, such a classification would give a 
negative index for a thriller and a positive index for an 
humor movie, whatever would be the real judgment. 
Such a classification would come from the fact that the 
text describing the movie would contain words to be con- 
sidered as negative, respectively positive, in the context 
of an appreciation. For example, supposing that "acting" 
is a key words and that the text is: The acting is great. 
The monster was awful**. Processing every sentence will 
lead to a opinionless score ("great" is positive, "awful" 
is negative). Only taking into account the sentences 
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comprising a key subject allows ignoring the descriptive 
sentence and leads to a correct positive score of the 
document. 

[0064] Another advantage is that the invention pro- 
vides a very simple method to solve the difficulties in 
interpreting a sentence with a previous sentence nega- 
tor. 

[0065] The configuration options offered to the user 
may be modified and adapted to the field of use. Imple- 
menting the invention with conventional programming 
techniques and devices, is in the ability of those skilled 
in the art. The documents to be processed may be pre- 
viously selected from the data base on the basis of other 
selection elements. For example, one may use a con- 
ventional search engine for pre-selecting a group of doc- 
uments from the web. 

[0066] The above description has been made with ref- 
erence to sentences or segments of sentence. Howev- 
er, more generally, one can divide the text into windows 
of words which do not correspond to sentences. For ex- 
ample, a window can correspond to a predetermined 
number of words foregoing and following each occur- 
rence of a key subject. The document can also be sub- 
divided into successive windows containing each the 
same predetermined number of words. The two forego- 
ing kinds of windows can be combined with anti-key win- 
dows corresponding to a predetermined number of 
words foregoing and following each occurrence of an 
anti-key subject. 

[0067] Having thus described at least one illustrative 
embodiment of the invention, various alterations, mod- 
ifications, and improvements will readily occur to those 
skilled in the art. Such alterations, modifications, and im- 
provements are intended to be within the spirit and 
scope of the invention. Accordingly, the foregoing de- 
scription is by way of example only and is not intended 
to be limiting. The invention is limited only as defined in 
the following claims and the equivalent thereto. 



Claims 

1 . A computer based method for automatically deter- 
mining in a document an object satisfaction score 
(44), choosing at least one of either a positive score 
or a negative score, of an object such as a product, 
service, brand, event or similar, comprising the 
steps of: 

subdividing the document into windows of 
words; 

selecting one of said windows when it contains 
at least one descriptive property corresponding 
to a key subject of said object and/or when it 
does not contain any descriptive property cor- 
responding to an anti-key subject of said object; 
processing each selected window to determine 
a positive or negative score of said window by 



counting (51) the occurrences of positive and 
negative concepts; and 
cumulating (60) the scores of the selected win- 
dows to obtain said object satisfaction score for 
5 the document. 

2. The method of claim 1 , further comprising the steps 
of: 

10 - storing in a computer memory a list of positive 
concepts (33) concerning said object; 
storing in said computer memory a list of neg- 
ative concepts (33) concerning said object; 
storing in said computer memory a list of de- 

*5 scriptive properties (32, 35, 36) corresponding 

to key subjects of said object; and 
storing in said computer memory a list of de- 
scriptive properties (32, 35, 36) corresponding 
to anti-key subjects of said object. 

20 

3. The method of claim 1 , further comprising the steps 
of: 

counting (51 ) the number of occurrences of ne- 
25 gation particles in the current selected window; 

and 

inverting (57) the positive and negative scores 
of the current window if said number is odd (56). 

30 4. The method of claim 1 , wherein said windows of 
words are sentences or segments of sentence. 

5. The method of claim 4, wherein the positive or neg- 
ative score of a previous sentence is subtracted 
35 (53) from the document corresponding score when 
a word considered as a sentence separator and 
able to constitute a negator of the previous sen- 
tence appears in the current sentence (52). 

ie method of claim 5, also comprising the step of 
oring in said computer memory a list of previous 
jgators (34). 

computer based method for automatically deter- 
ining an object satisfaction index of an object such 
> a product, service, brand, event or similar, 
nong documents contained in at least one local or 
mote memory storage (2, 3) comprising the steps 
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selecting (42) from the memory storage docu- 
ments relating to said object; 
processing (43) the selected documents ac- 
cording to anyone of claims 1 to 6; and 
55 - cumulating (45) the scores of the selected doc- 
uments and deducing from obtained cumulated 
scores said object satisfaction index. 
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8. The method of claim 7, wherein the selection of a 
document is based on the occurrences of said key 
subjects and/or anti-key subjects counted (51) dur- 
ing said processing step of each window. 

9. The method of claim 8, wherein a document score 
is added (67) to the corresponding cumulated score 
only if the absolute value of the difference between 
the positive and negative scores of this document 
is higher than a predetermined satisfaction thresh- 
old (66), or is otherwise considered as opinionless. 

10. The method of claim 8, further comprising counting 
(51, 61) the number of occurrences of competitor 
subjects in each document and wherein a document 
score is added to the corresponding cumulated 
score only if said number of competitor subject oc- 
currences is lower than a predetermined compari- 
son threshold (65). 

11. The method of claim 10, wherein the competitor 
subjects are part of the anti-key subjects and are 
counted (51) during said processing step of each 
selected window. 

12. A computer based system for automatically deter- 
mining an object satisfaction index for an object 
such as a product, service, brand, event or similar 
comprising: 

a general purpose computer having at least an 
output device (14), at least an input device (12), 
at least a communication device for communi- 
cating with at least one local and/or remote doc- 
ument memory storage (2, 3), and a central 
processing unit (10), 
said central processing unit comprising: 

at least a computer memory for storing at 
least a first list of positive concepts (33) 
concerning said object, a second list of 
negative concepts (33) concerning said 
object, and a third list of descriptive prop- 
erties (32, 35, 36) of said object, 
a counter for counting, in windows of words 
of a selected document, the occurrences 
of positive and/or negative concepts, for 
cumulating the scores of said windows to 
obtain a satisfaction score of said object in 
the document, a window being taken into 
account in said satisfaction score only if it 
contains at least one descriptive property 
corresponding to a key subject of said ob- 
ject and/or if it does not contain any de- 
scriptive property corresponding to an anti- 
key subject of said object. 

13. The computer based system of claim 12, wherein 



said counter cumulates the satisfaction scores of 
selected documents of the memory storage to de- 
termine a satisfaction index of said object. 

5 14. The computer based system of claim 12, further 
comprising a search engine for selecting docu- 
ments on the basis of said descriptive properties. 

15. The computer based system of claim 12, wherein 
10 said counter also counts the occurrences of said de- 
scriptive properties. 

16. The computer based system of claim 12, further 
comprising a data table (34) to store configuration 

15 choices. 

17. The computer based system of claim 12, further 
comprising a list of previous negators so that, when 
such a previous negator appears in a current win- 

20 dow, the positive or negative score of the previous 
window is disregarded. 

18. The computer based system of claim 12, further 
comprising tools to review the objects description 

25 properties and/or the positive and negative con- 
cepts and/or the document satisfaction scores, and/ 
or the base document or meta data. 

19. An automatic system for determining a satisfaction 
index of an object such as a product, service, brand, 
event or similar, among a group of documents con- 
tained in at least one local or remote memory stor- 
age (2, 3), characterized in that it comprises 
means for implementing the method according to 
any of claim 7 to 11. 

20. A computer program product implementing the 
method according to any of claims 1 to 11. 
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